Integrity Protocols for Recovering from Distributable Real-Time Thread Failures with Assured Timeliness in Dynamic Systems
نویسندگان
چکیده
We consider the problem of recovering from failures of distributable threads with assured timeliness in dynamic systems with overloads, and node and (permanent/transient) network failures. When a distributable thread encounters a failure that prevents its timely execution, the thread must be terminated. Thread termination involves detecting and aborting thread orphans, and delivering exceptions to the farthest, contiguous surviving thread segment for possible execution resumption. Thread termination operations must optimize system-wide timeliness. We present a scheduling algorithm called HUA and two thread integrity protocols called D-TPR and W-TPR. We show that they bound the orphan cleanup and recovery time with bounded loss of the best-effort property—i.e., high importance threads are always favored over low importance ones (for feasible completion), irrespective of thread urgency. Our implementation experience using the emerging Reference Implementation of Sun’s Distributed Real-Time Specification for Java demonstrates the algorithm/protocols’ effectiveness.
منابع مشابه
Assured-Timeliness Integrity Protocols for Distributable Real-Time Threads with in Dynamic Distributed Systems
Networked embedded systems present unique challenges for system designers composing distributed applications with dyanmic, real-time, and resilience requirements. We consider the problem of recovering from failures of distributable threads with assured timeliness in dynamic systems with overloads, and node and (permanent/transient) network failures. When a distributable thread encounters a fail...
متن کاملFast Scheduling of Distributable Real-Time Threads with Assured End-to-End Timeliness
We consider networked, embedded real-time systems that operate under run-time uncertainties on activity execution times and arrivals, node failures, and message losses. We consider the distributable threads abstraction for programming and scheduling such systems, and present a thread scheduling algorithm called QBUA. We show that QBUA satisfies (end-to-end) thread time constraints in the presen...
متن کاملEnhancing the Adaptivity of Distributed Real-time and Embedded Systems via Standard QoS-enabled Dynamic Scheduling Middleware
To support the dynamically changing QoS needs of open distributed real-time embedded (DRE) systems, it is essential to propagate QoS parameters and to enforce task QoS requirements adaptably across multiple endsystems dynamically in a way that is simultaneously efficient, flexible, and timely. This paper makes three contributions to research on QoS-enabled middleware that supports these types o...
متن کاملScheduling Distributable Real-Time Threads in Middleware
We present the Tempus real-time middleware. Tempus supports Real-Time CORBA 2.0’s distributable threads (DTs) as an end-to-end programming abstraction for distributed real-time systems. DTs in Tempus can have time constraints, including time/utility functions (TUFs), can have resource constraints, particularly mutual exclusion, and can be scheduled according to utility accrual (UA) disciplines....
متن کاملExploiting Slack for Scheduling Dependent, Distributable Real-Time Threads in Mobile Ad Hoc Networks
We consider scheduling distributable real-time threads with dependencies (e.g., due to synchronization) in mobile ad hoc networks, in the presence of node/link failures, message losses, and dynamic node joins and departures. We present a distributed realtime scheduling algorithm called RTG-DS. The algorithm uses a gossip-style protocol for discovering eligible nodes, node/link failures, and mes...
متن کامل